AITopics

Industry: Law (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsFeb-9-2026, 18:42:22 GMT

72372ec86dd49238900fc0b68bad63f8-Paper-Datasets_and_Benchmarks.pdf

Testifying to their utility to accurately represent abstractions, completion and extrapolation tasks on integer sequences are a frequent part of general human intelligence and aptitude testing ([42, 31]).

artificial intelligence, machine learning, sequence, (17 more...)

Country: Europe > Switzerland (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.88)

Neural Information Processing SystemsAug-15-2025, 20:41:26 GMT

72372ec86dd49238900fc0b68bad63f8-Supplemental-Datasets_and_Benchmarks.pdf

artificial intelligence, machine learning, natural language, (19 more...)

Industry:

Law (0.67)
Information Technology (0.46)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Neural Information Processing SystemsAug-15-2025, 20:41:23 GMT

72372ec86dd49238900fc0b68bad63f8-Paper-Datasets_and_Benchmarks.pdf

artificial intelligence, machine learning, natural language, (16 more...)

Country:

Europe > Switzerland > Zürich > Zürich (0.15)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
(2 more...)

Abaskohi, Amirhossein, Gella, Spandana, Carenini, Giuseppe, Laradji, Issam H.

FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering

arXiv.org Artificial IntelligenceDec-17-2024

Multimodal multihop question answering is a complex task that requires reasoning over multiple sources of information, such as images and text, to answer questions. While there has been significant progress in visual question answering, the multihop setting remains unexplored due to the lack of high-quality datasets. Current methods focus on single-hop question answering or a single modality, which makes them unsuitable for real-world scenarios such as analyzing multimodal educational materials, summarizing lengthy academic articles, or interpreting scientific studies that combine charts, images, and text. To address this gap, we propose a novel methodology, introducing the first framework for creating a high-quality dataset that enables training models for multimodal multihop question answering. Our approach consists of a 5-stage pipeline that involves acquiring relevant multimodal documents from Wikipedia, synthetically generating high-level questions and answers, and validating them through rigorous criteria to ensure quality data. We evaluate our methodology by training models on our synthesized dataset and testing on two benchmarks, our results demonstrate that, with an equal sample size, models trained on our synthesized data outperform those trained on human-collected data by 1.9 in exact match (EM) on average. We believe our data synthesis method will serve as a strong foundation for training and evaluating multimodal multihop question answering models.

information, natural language, question answering, (19 more...)

2412.0703

Country:

North America > Canada > Quebec > Capitale-Nationale Region > Québec (0.05)
North America > Canada > Quebec > Capitale-Nationale Region > Quebec City (0.05)
Asia > China (0.04)
(10 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Government > Military (1.00)
Leisure & Entertainment (0.93)
Government > Regional Government > North America Government > United States Government (0.93)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)

arXiv.org Artificial IntelligenceDec-15-2024

AD-LLM: Benchmarking Large Language Models for Anomaly Detection

Yang, Tiankai, Nian, Yi, Li, Shawn, Xu, Ruiyao, Li, Yuangang, Li, Jiaqi, Xiao, Zhuo, Hu, Xiyang, Rossi, Ryan, Ding, Kaize, Hu, Xia, Zhao, Yue

Anomaly detection (AD) is an important machine learning task with many real-world uses, including fraud detection, medical diagnosis, and industrial monitoring. Within natural language processing (NLP), AD helps detect issues like spam, misinformation, and unusual user activity. Although large language models (LLMs) have had a strong impact on tasks such as text generation and summarization, their potential in AD has not been studied enough. This paper introduces AD-LLM, the first benchmark that evaluates how LLMs can help with NLP anomaly detection. We examine three key tasks: (i) zero-shot detection, using LLMs' pre-trained knowledge to perform AD without tasks-specific training; (ii) data augmentation, generating synthetic data and category descriptions to improve AD models; and (iii) model selection, using LLMs to suggest unsupervised AD models. Through experiments with different datasets, we find that LLMs can work well in zero-shot AD, that carefully designed augmentation methods are useful, and that explaining model selection for specific datasets remains challenging. Based on these results, we outline six future research directions on LLMs for AD.

category, large language model, machine learning, (19 more...)

2412.11142

Country: North America > United States (0.92)

Genre: Research Report > New Finding (0.93)

Industry:

Information Technology (0.67)
Health & Medicine (0.66)
Law Enforcement & Public Safety > Fraud (0.48)
Energy > Oil & Gas (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Chalcroft, Liam, Pappas, Ioannis, Price, Cathy J., Ashburner, John

Synthetic Data for Robust Stroke Segmentation

arXiv.org Artificial IntelligenceApr-2-2024

Deep learning-based semantic segmentation in neuroimaging currently requires high-resolution scans and extensive annotated datasets, posing significant barriers to clinical applicability. We present a novel synthetic framework for the task of lesion segmentation, extending the capabilities of the established SynthSeg approach to accommodate large heterogeneous pathologies with lesion-specific augmentation strategies. Our method trains deep learning models, demonstrated here with the UNet architecture, using label maps derived from healthy and stroke datasets, facilitating the segmentation of both healthy tissue and pathological lesions without sequence-specific training data. Evaluated against in-domain and out-of-domain (OOD) datasets, our framework demonstrates robust performance, rivaling current methods within the training domain and significantly outperforming them on OOD data. This contribution holds promise for advancing medical imaging analysis in clinical settings, especially for stroke pathology, by enabling reliable segmentation across varied imaging sequences with reduced dependency on large annotated corpora.

dataset, lesion, segmentation, (15 more...)

2404.01946

Genre: Research Report > Experimental Study (0.31)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Khosravi, Bardia, Li, Frank, Dapamede, Theo, Rouzrokh, Pouria, Gamble, Cooper U., Trivedi, Hari M., Wyles, Cody C., Sellergren, Andrew B., Purkayastha, Saptarshi, Erickson, Bradley J., Gichoya, Judy W.

Synthetically Enhanced: Unveiling Synthetic Data's Potential in Medical Imaging Research

arXiv.org Artificial IntelligenceNov-15-2023

Chest X-rays (CXR) are the most common medical imaging study and are used to diagnose multiple medical conditions. This study examines the impact of synthetic data supplementation, using diffusion models, on the performance of deep learning (DL) classifiers for CXR analysis. We employed three datasets: CheXpert, MIMIC-CXR, and Emory Chest X-ray, training conditional denoising diffusion probabilistic models (DDPMs) to generate synthetic frontal radiographs. Our approach ensured that synthetic images mirrored the demographic and pathological traits of the original data. Evaluating the classifiers' performance on internal and external datasets revealed that synthetic data supplementation enhances model accuracy, particularly in detecting less prevalent pathologies. Furthermore, models trained on synthetic data alone approached the performance of those trained on real data. This suggests that synthetic data can potentially compensate for real data shortages in training robust DL models. However, despite promising outcomes, the superiority of real data persists.

supp, synth 0, synthetic data, (13 more...)

2311.09402

Country:

North America > United States > Minnesota > Olmsted County > Rochester (0.04)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
North America > United States > Montana (0.04)
(7 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.46)

Belcák, Peter, Kastrati, Ard, Schenker, Flavio, Wattenhofer, Roger

FACT: Learning Governing Abstractions Behind Integer Sequences

arXiv.org Artificial IntelligenceSep-20-2022

Integer sequences are of central importance to the modeling of concepts admitting complete finitary descriptions. We introduce a novel view on the learning of such concepts and lay down a set of benchmarking tasks aimed at conceptual understanding by machine learning models. These tasks indirectly assess model ability to abstract, and challenge them to reason both interpolatively and extrapolatively from the knowledge gained by observing representative examples. To further aid research in knowledge representation and reasoning, we present FACT, the Finitary Abstraction Comprehension Toolkit.

artificial intelligence, machine learning, natural language, (19 more...)